Everything About Transformers
krupadave.com·2d
👁️Attention Optimization
Flag this post
A Minimal Route to Transformer Attention
neelsomaniblog.com·2d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
[D] Best (free) courses on neural networks
reddit.com·3h·
👁️Attention Optimization
Flag this post
Dual-format attentional template during preparation in human visual cortex
elifesciences.org·3d
Flash Attention
Flag this post
An underqualified reading list about the transformer architecture
fvictorio.github.io·2d·
Discuss: Hacker News
Flash Attention
Flag this post
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
paperium.net·1d·
Discuss: DEV
🏎️TensorRT
Flag this post
**Breaking the Curse of Dimensionality: A Game-Changer for L
dev.to·1d·
Discuss: DEV
👁️Attention Optimization
Flag this post
Specialized structure of neural population codes in parietal cortex outputs
nature.com·1d
Flash Attention
Flag this post
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
medium.com·4d·
Flash Attention
Flag this post
Everything About Transformers
krupadave.com·2d·
👁️Attention Optimization
Flag this post
RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection
towardsdatascience.com·1d
👁️Attention Optimization
Flag this post
Minimax pre-training lead explains why no linear attention
reddit.com·2d·
Discuss: r/LocalLLaMA
Flash Attention
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·1d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images
hotorslop.com·1d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Why AI companions exploit the same psychology as teddy bears
lightcapai.medium.com·26m·
Discuss: Hacker News
📊Gradient Accumulation
Flag this post
🧠 Soft Architecture (Part B): Emotional Timers and the Code of Care (Part 5 of the SaijinOS series)
dev.to·6h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Evidence on language model consciousness
lesswrong.com·15h
🏎️TensorRT
Flag this post
The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?
arxiv.org·1d
🏎️TensorRT
Flag this post
Show HN: Free unlimited AI video animation with no daily limits or signups
animateforever.com·6h·
Discuss: Hacker News
👁️Attention Optimization
Flag this post